Markov decision process - PDFSEARCH.IO - Document Search Engine

Markov decision process
Results: 537

#	Item
491	research highlights doi:[removed][removed]Technical Perspective The Ultimate Pilot Program By Stuart Russell and Lawrence Saul Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2010-05-29 04:46:06 Markov processes Stochastic control Reinforcement learning Markov decision process Apprenticeship learning Statistics Machine learning Dynamic programming
492	Writing Stratagus-playing Agents in Concurrent ALisp Bhaskara Marthi, Stuart Russell, David Latham Department of Computer Science University of California Berkeley, CA 94720 {bhaskara,russell,latham}@cs.berkeley.edu Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2005-05-26 13:17:00 Mathematical sciences Cybernetics Reinforcement learning Q-learning Markov decision process Algorithm Machine learning Thread Intelligent agent Statistics Artificial intelligence Applied mathematics
493	Selecting Computations: Theory and Applications Nicholas Hay and Stuart Russell Computer Science Division University of California Berkeley, CA 94720 Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2012-10-04 09:08:48 Decision theory Markov models Mathematical optimization Dynamic programming Reinforcement learning Markov decision process Markov chain Value of information Variance Statistics Probability and statistics Markov processes
494	Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]dandre,russell @cs.berkeley.edu Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2006-01-06 14:54:10 Markov processes Stochastic control Reinforcement learning Markov decision process Q-learning Phạm Machine learning Subroutine Reinforcement Statistics Computer programming Dynamic programming
495	RAPID: A Reachable Anytime Planner for Imprecisely-sensed Domains Emma Brunskill Computer Science Department University of California, Berkeley Berkeley, CA Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2010-06-12 13:34:38 Stochastic control Partially observable Markov decision process Automated planning and scheduling Markov decision process Reinforcement learning Bayesian network Observable State space Action Statistics Dynamic programming Markov processes
496	Concurrent Hierarchical Reinforcement Learning Bhaskara Marthi, Stuart Russell, David Latham Computer Science Division University of California Berkeley, CA 94720 {bhaskara,russell,latham}@cs.berkeley.edu Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2005-04-15 21:04:50 Concurrency control Concurrency Reinforcement learning Markov decision process Thread Q-learning Monitor Deadlock SARSA Statistics Computing Mathematics
497	State Abstraction for Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]fdandre,[removed] Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2008-01-03 13:48:15 Mathematical optimization Dynamic programming Equations Operations research Systems engineering Reinforcement learning Q-learning Markov decision process Bellman equation Statistics Systems theory Control theory
498	Q-Decomposition for Reinforcement Learning Agents Stuart Russell @.. Andrew L. Zimdars @.. Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2003-06-03 00:44:40 Systems theory Equations Mathematical optimization SARSA Q-learning Markov processes Stochastic control Reinforcement learning Markov decision process Statistics Control theory Dynamic programming
499	Multi-armed Bandit Problems with Dependent Arms Sandeep Pandey Add to Reading List Source URL: www.cs.cmu.edu Language: English - Date: 2007-06-18 15:15:12 Stochastic optimization Decision theory Markov decision process Gittins index Statistics Machine learning Multi-armed bandit
500	Journal of Artificial Intelligence Research[removed]472 Submitted[removed]; published[removed] Add to Reading List Source URL: www.cs.tufts.edu Language: English - Date: 2008-03-25 22:30:40 Dynamic programming Model theory Markov processes Stochastic control Boolean algebra Markov decision process Reinforcement learning Function Propositional variable Mathematics Statistics Logic